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Abstract — An efficient scheme for initial ranging has recently 
been proposed by X. Fu et al. in the context of orthogonal 
frequency-division multiple-access (OFDMA) networks based on 
the IEEE 802.16e-2005 standard. The proposed solution aims at 
estimating the power levels and timing offsets of the ranging 
subscriber stations (RSSs) without taking into account the effect 
of possible carrier frequency offsets (CFOs) between the received 
signals and the base station local reference. Motivated by the 
above problem, in the present work we design a novel ranging 
scheme for OFDMA in which the ranging signals are assumed 
to be misaligned both in time and frequency. Our goal is to 
estimate the timing errors and CFOs of each active RSS. Specif- 
ically, CFO estimation is accomplished by resorting to subspace- 
based methods while a least-squares approach is employed for 
timing recovery. Computer simulations are used to assess the 
effectiveness of the proposed solution and to make comparisons 
with existing alternatives. 

I. Introduction 

The main impairment of an orthogonal frequency-division 
multiple-access (OFDMA) network is represented by its re- 
markable sensitivity to timing errors and carrier frequency 
offsets (CFOs) between the uplink signals and the base station 
(BS) local references. For this reason, the IEEE 802.16e-2005 
standard for OFDMA-based wireless metropolitan area net- 
works (WMANs) specifies a synchronization procedure called 
Initial Ranging (IR) where subscriber stations that intend to 
establish a link with the BS can use some dedicated subcarriers 
to transmit their specific ranging codes [1]. Once the BS has 
revealed the presence of ranging subscriber stations (RSSs), it 
has to estimate some fundamental parameters including timing 
errors, CFOs and power levels. 

Two prominent schemes for initial synchronization and 
power control in OFDMA were proposed in [2] and [3]. In 
these works, a long pseudo-noise sequence is transmitted by 
each RSS over the available ranging subcarriers. Timing recov- 
ery is then accomplished on the basis of suitable correlations 
computed in the frequency- and time-domain, respectively. 
The main drawback of these methods is their sensitivity to 
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multipath distortion, which desttoys orthogonality among the 
employed codes and gives rise to multiple access interference 
(MAI). Better results are obtained in [4] by using a set 
of generalized chirp-like (GCL) sequences, which echibits 
increased robustness against the channel selectivity. A differ- 
ent approach to managing the IR process has recently been 
proposed in [5]. Here, the pilot streams transmitted by RSSs 
are spread in the time-domain over adjacent OFDM blocks 
using orthogonal codes. In this way, signals of different RSSs 
can be easily separated at the BS as they remain orthogonal 
after propagating through the channel. Timing information is 
eventually acquired in an iterative fashion by exploiting the 
autocorrelation properties of the received samples induced by 
the use of the cyclic prefix (CP). Unfortunately, this scheme is 
derived under the assumption of perfect frequency alignment 
between the received signals and the BS local reference. 
Actually, the occurrence of residual CFOs results into a loss 
of orthogonality among ranging codes and may lead to severe 
degradations of the system performance in terms of mis- 
detection probability and estimation accuracy. 

In the present work we propose a novel ranging scheme 
for OFDMA systems that is robust to time and frequency 
misalignments. The goal is to estimate timing errors and 
CFOs of all active RSSs. The number of active codes is 
found by resorting to the minimum description length (MDL) 
principle [6] while the multiple signal classification (MUSIC) 
algorithm [7] is employed to detect which codes are actually 
active and to determine their corresponding CFOs. Timing 
estimation is eventually achieved through least-squares (LS) 
methods. Although the proposed solution allows one to esti- 
mate the timing errors of each RSS in a decoupled fashion, 
it may involve huge computational burden in applications 
characterized by large propagation delays. For this reason, 
we also present an alternative scheme derived from ad hoc- 
reasoning which results into substantial computational saving. 
It is worth noting that timing synchronization in OFDMA 
uplink transmissions has received little attention so far. A 
well-established way to handle timing errors is to design the 
CP length large enough to include both the channel delay 
spread and the two-way propagation delay between the BS 



and the user station [8]. This leads to a quasi-synchronous 
system in which timing errors can be viewed as part of the 
channel impulse response (CIR) and are compensated for by 
the channel equalizer. Unfortunately, this approach poses an 
upper limit to the maximum tolerable propagation delay or, 
equivalently, to the maximum distance between the BS and 
the subscriber stations [9]. For this reason, its application to 
a scenario with large cells (as envisioned in next broadband 
wireless networks) is hardly viable. In the latter case, accurate 
knowledge of the timing errors is required in order to align 
the uplink signals to the BS time scale. 

II. System description and signal model 

A. System description 

The investigated OFDMA network employs N subcarriers 
with frequency spacing A/ and indices in the set J = 
{Q, 1, . . . , JV — 1}. Following [5], we denote R the number 
of subchannels reserved for the IR process. Each subchannel 
is divided into Q subbands uniformly spaced over the signal 
bandwidth at a distance (N/Q)Af from each other. A given 
subband is composed of a set of V adjacent subcarriers. The 
subcarrier indices within the qth subband (q = 0, 1, . . . , Q— 1) 
of the rth ranging subchannel (r = 0, 1, . . . , R — 1) are 

(r) r (r) 

collected into a set jq = {i q ,l ; v = 0, 1, . . . , V — 1} with 
entries 

. M qN rN 
■>y> = 1 h V. 

Q QR 



(1) 



The 7th subchannel is thus composed of subcarriers with 
indices taken from = {J®Zq Jq T \ Hence, a total of 

Nr — QVR ranging subcarriers are available in the system 
with indices in the set Jr = U^LqJ'^. The remaining 
N — Nr subcarriers are used for data transmission and are 
assigned to data subscriber stations (DSSs) which have already 
completed their IR process and are assumed to be perfectly 
synchronized to the BS time and frequency scales [5]. 

We denote by M the number of consecutive OFDM blocks 
reserved for IR and assume that each ranging subchannel can 
be accessed at most by M— 1 RSSs. The latter are separated by 
means of specific ranging codes selected in a pseudo-random 
fashion from a predefined set {ci, C2, . . . , cm-i}, with c k = 
[cfc(l), Cfe(2), . . . , c k (M)] T (the superscript T denotes the 
transpose operation). As in [5], we assume that different RSSs 
employ different codes. Without loss of generality, in what 
follows we concentrate on the rth ranging subchannel and 
denote by < M — 1 the number of simultaneously active 
RSSs. Also, to simplify the notation, the subchannel index ^ 
is dropped in all subsequent derivations. 

The waveform transmitted by the fcth RSS (1 < k < K) 
propagates through a multipath channel characterized by an 
impulse response = [h k (0), h k (l), . . . , h k (L — 1)] T of 
length L (in sampling periods). At the BS, the received 
samples are not synchronized with the local references. We 
denote by 9 k the timing error expressed in sampling periods 
while £fe is the frequency offset normalized to the subcarrier 
spacing. As discussed in [8], subscriber stations that intend 



to start the ranging process compute initial frequency and 
timing estimates on the basis of a downlink control signal 
broadcast by the BS. The estimated parameters are then 
employed by each RSS as synchronization references for 
the uplink ranging transmission. This means that during IR 
the CFOs are only due to Doppler shifts and/or estimation 
errors and, in consequence, they are assumed to lie within 
a small fraction of the subcarrier spacing. Furthermore, in 
order to eliminate interblock interference (IBI), we assume 
that during the ranging process the CP length comprises 
Ng > Omnx+L sampling periods, where 6 max is the maximum 
expected timing error [9]. This assumption is not restrictive, 
since in many standardized OFDM systems the initialization 
blocks are usually preceded by long CPs. 

B. Signal model 

We denote by Y TO the QV-dimensional vector that collects 
the DFT outputs corresponding to the considered subchannel 
during the mth OFDM block. Since the DSSs are assumed 
to be perfectly synchronized to the BS references, their sig- 
nals will not contribute to Y m . In contrast, the presence of 
uncompensated CFOs destroys orthogonality among ranging 
signals, thereby leading to some interchannel interference 
(ICI). However, as the subchannels are well separated in the 
frequency domain, we can reasonably neglect interference on 
Y m arising from ranging signals of subchannels other than 
the considered one. Under this assumption, we may write 



K 



Y m = c k (m)e> mu * N * A{u k )S k (6 k ) + n „ 



(2) 



k=l 



where uj k = 2ire k /N, Nt = N + Nq is the duration of the 
cyclically extended block and n TO is a Gaussian vector with 
zero mean and covariance matrix <j 2 \qv (we denote by Ijy 
the identity matrix of order N). Also, we have defined 

A(w fc ) = FV(u> k )F H (3) 
where V(cjfe) accounts for the CFOs and is given by 

V(« fc ) = diag {e^;n = 0, 1, . . . , N - 1} (4) 

while F = [Fq, Ff , . . . , F^_J H (the superscript H denotes 
the Hermitian transposition) with F q {q = 0, 1, . . . , Q — 1) 
denoting al'xJV matrix with entries 

(5) 

Vector Sk(0k) in © can be partitioned as Sfc(^) = 



[Sl(d k ,i 1 ),SZ(6 k ,i 2 ),...,Sl(8 k ,i Q ^ 1 )Y, 
S k (9k,iq) is a U-dimensional vector with elements 



where 



S k {e k ^) = e- Mi ^' N H k {i q<v ), 0<i/<V-l(6) 

while H k (i q ,v) denotes the channel frequency response over 
the i 9i „th subcarrier and is given by 



L-l 

H k {i q , v ) = Y J h k {t)e-^ u ^' N . (7) 

1=0 

From (O we see that 6 k simply appears as a phase shift across 
the DFT outputs. The reason is that the CP length is larger 
than the maximal expected propagation delay, thereby making 
the ranging signals quasi-synchronous. 

In the following sections we show how vectors {Y m }^~J 
can be exploited to compute frequency and timing estimates 
for all active ranging codes. 

III. Estimation of the CFOs 

To simplify the derivation, we assume that the CFOs are 
adequately smaller than the subcarrier spacing, i.e., \u> k \ <C 
1. In such a case, matrices A(ui k ) in (01 can reasonably be 
replaced by Ijv to obtain [8] 

K 

Y m » Yl c k (m)e^ NT S k (9 k ) + n m . (8) 
k =i 

This equation indicates that each CFO results only in a phase 
shift between contiguous OFDM blocks. Collecting the i g i/ th 
DFT output of all M ranging blocks into a vector Y(i q<v ) = 
\X6(iq,v),Yi(i qjV ), . . . ,Y M -i{i q ,u)] T , we may write 

K 

Y (h^) = X! s k(6k, i q ,v)T(wk)ck + n(i g ,„) (9) 
fe=i 

where n{i qv ) is Gaussian distributed with zero-mean and 
covariance matrix (7 2 Im, while T(ui k ) = diag{e- jm "' iA ' T ; m 

0.1 V/ 1} is a diagonal matrix that accounts for the 

phase shifts induced by u> k . 

Inspection of © reveals that, apart from thermal noise, 
vector Y(t q<v ) is a linear combination of the frequency-rotated 
codes {T(uj k )c k }. This means that the signal space is spanned 
by the K vectors {T(uJk)ck} that correspond to the active 
RSSs [10]. Then, if we temporarily assume that the number 
K of active codes is known at the receiver, an estimate of 
uj k (k = 1,2, ... ,K) can be obtained by resorting to the 
MUSIC algorithm [7]. To see how this comes about, we use 
the observations {Y(i q l ,)} to obtain the following sample 
correlation matrix 

V-lQ-l 

^ = 7wEE Y(i q ,„)Y H ( lq ,„). (10) 

Next, based on the forward-backward (FB) approach [10], we 
compute 

Ry = i(R y +JR^J) (11) 

where J is the exchange matrix with l's on the anti-diagonal 
and 0's elsewhere. We denote by Ai > A2 > • • • > Am the 
eigenvalues of Ry arranged in non-increasing order, and by 
{si, S2 5 . . . , sjf} the corresponding eigenvectors. The MUSIC 
algorithm relies on the fact that the eigenvectors associated 



with the AI — K smallest eigenvalues are an estimated basis of 
the noise subspace and, accordingly, they are approximately 
orthogonal to all vectors in the signal space [7]. Hence, an 
estimate of ui k is obtained by minimizing the projection of 
r(w)cfc onto the noise subspace, i.e., 

ui k = argmax{^ fc (w)} , (12) 

with 

= — 77 7. (13) 

It is worth observing that CFO recovery must be accom- 
plished for any active RSS. However, since the BS has no 
prior knowledge as to which codes have been transmitted 
in the considered subchannel, it must evaluate the quantities 
{0)1,0)2, . . . ,&m-i} for the complete set {ci,c 2 , . . . , c M -i}- 
At this stage the problem arises of identifying which 
codes are actually active. The identification algorithm looks 
for the K largest values in the set {^'fc(o)fc)}^f = 7 1 , say 
{^u k (^u k )} k —i> an d declare as active the corresponding 
codes {c Uk } k= i- The CFO estimates are eventually found as 

— [&U1 ) &U2 ! • • • ! &UK ] ' 

At this stage we are left with the problem of estimating the 
parameter K to be used in (fT3l . For this purpose, we adopt 
the MDL approach and obtain [6] 

K = argmin {^(iT)} (14) 

where T(K) is the following metric 

T{K) = -K(2M-k)ln(QV)-QV(M-k)lnp(K) (15) 

with p(K) denoting the ratio between the geometric and 
arithmetic mean of {A^ +1 , A^ +2 , . . . , Am}- Finally, replacing 
A by A in ( fT3l leads to the proposed MUSIC-based frequency 
estimator (MFE) while the described identification algorithm 
is called the MUSIC-based code detector (MCD) 

IV. Estimation of the timing delays 

After code detection and CFO recovery, the BS must acquire 
information about the timing delays of all ranging signals. 
This problem is now addressed by resorting to LS methods. 
In doing so we still assume that the number of active codes 
has been correctly estimated so that K = K. Also, to simplify 
the notation, the indices {u k }^ =1 of the detected codes are 
relabeled following the map u k — ► k for k = 1, 2, . . . , K. 

We begin by reformulating Q in a more compact form. 
For this purpose, we collect the CFOs and timing errors 
in two If -dimensional vectors u = [u>i, 0J2, ■ ■ ■ , ojk] T and 
9 = [9 1, 82, ■ ■ ■ , 0k] T ■ Then, after defining the matrix 
C(u>) = [r(o;i)ci r(o;2)c2 ••• T(u>k)ck] and the vector 
S(0,ig,i,) = [Si(8i, i q ^), 82(62, i q ,v), ■ ■ ■ , Sk{0k, i q ,v)) T , 
we may rewrite (O in the equivalent form 



Y(^) = C(w)S(6>,i q , !/ )+n(i g , !/ ). 



(16) 



Omitting for simplicity the functional dependence of S(9, i qM ) 
on and assuming lj uj, from dT6t the maximum likelihood 
estimate of S(i q v ) is found to be 

S(i g>v ) = [C H (u>)C(u>)]- l C H (u>)Y(i qtU ). (17) 
Substituting ([T6]l into ([T7J yields 



(18) 



where £(i q ,v) is a zero-mean disturbance term. From © and 
(0 it follows that 



and we have denoted by s k (v, £) the Q-point IDFT of the 
sequence {S k (iq <u ); < <z < Q — 1}. In the sequel is 
termed the LS-based timing estimator (LS-TE). 

Once 8 k has been computed from d23t . it is used in ( 1221 to 
estimate the CIR of the fcth RSS as 



T(0 k ) 



e k +L-i 



v=i 



(25) 



It is worth noting that for V — 1 the timing metric (I24l i reduces 
to 



(26) 



L-l 



+ &(*«,./). d9) 



On denoting S fc (^) = S k (i 0iV ), S k (i 1>v ), . . . , S k (i Q _i <v ) , 

g = 0,l,...,Q-l}, we 



and &(9 k , v) = diag{e J « V; 
may rewrite ( fT9l as follows 



S fc (i/) = *(0 fc) i/)F(i/)h fc +£ fc (i/) 



(20) 



where £fc(» = [6(io,i/), 6fe(«i,i/)> 6(*Q-i,f)] T while 
F(^) is a matrix of dimension Q x L with entries [F(i/)] g j = 
e -o^i q ,u for0<(7<Q-land0<^<L-l 

Equation d20b indicates that, apart from the disturbance term 
£ fe (V), Sfc(z/) is only contributed by the /cth RSS, meaning that 
ranging signals have been successfully decoupled at the BS. 
We may thus exploit vectors {Sfc(i') ; v = 0, 1, . . . , V — 1} 
to get LS estimates of (6 k , h k ) separately for each RSS. This 
amounts to minimizing the following objective function with 
respect to (6 k ,h k ) 



v-i 



A k (9 k ,h k ) = £ S*(i/) - *(0 fcl i/)F(i/)h» 



(21) 



For a fixed the minimum of A k (0 k , h k ) is achieved at 
1 

ht = ^^ F* (!/)*" (fl fc , i/)S fc (i/) (22) 

where we have used the identity ~F H (v)Y{v) — Q ■ II- Then, 
substituting (1221 into (f2Tb and minimizing with respect to 6*fc 
yields the timing estimate in the form 



i k — arg max 

o<8 k <e m 



{Wk)} 



(23) 



where T(9 k ) is given by 



'k+L-X 



T&)= E 



v-i 



/AT 



(24) 



and becomes periodic in 9 k with period Q. In such a case, 
the estimate 9 k is affected by an ambiguity of multiples of Q. 
This ambiguity does not represent a serious problem as long 
as Q can be chosen to be greater than 9 max . Unfortunately, in 
some applications this may not be the case. For example, in 
[5] we have Q = 8 while 9 max = 102. 

A. Reduced-complexity timing estimation 

Although separating the RSS signals at the BS considerably 
reduces the system complexity, evaluating T(8 k ) for 9 k = 
0, 1, ... , 9 max may still be computationally demanding, espe- 
cially in applications where # m ax is large. For this reason, we 
now develop an ad-hoc reduced complexity timing estimator 
(RC-TE). 

We begin by decomposing the timing error 9 k into a 
fractional part (3 k , less than Q, plus an integer part which 
is multiple of Q, i.e., 



h = Pk +PkQ 



(27) 



where (3 k £ {0,1, ... ,Q— 1} while p k is an integer parameter 
taken from {0, 1, . . . , P - 1} with P = [9 max /Q\. Omitting 
the details, it is possible to rewrite T(6 k ) as 



where 



r(o k ) = ri(j3 k ) + r 2 (0k,Pk) 

Pk+L-lV-l 

T i(A)= E £i**M)i 2 



(28) 



(29) 



while T2(Pk,p k ) is shown in (f30b at the top of the next page. 

The RC-TE is a suboptimal scheme which, starting from 
(l28l . estimates j3 k and p k in a decoupled fashion. More 
precisely, an estimate of /3 k is first obtained looking for the 
maximum of Ti (/?&), i.e., 



(31) 



/3 k = arg max {Ti(/3 fc )| 

0</3 fc <Q-l I. > 



Next, replacing (3 k with {3 k in the right-hand-side of (1281 1 and 
maximizing with respect to p k , provides an estimate of p k in 
the form 



T 2 (&,pfc) = 2Ke 



Ph+L-l V-2 V-u-1 



E E M".^ 

(f + n,£)e 



- 3 27vn(e+p k Q)/N 



(30) 



arg max 

0<p fc <P- 



i {T 2 (/3 fc ,p fe )} 



(32) 



A further reduction of complexity is possible when V = 
2. Actually, in this case it can be shown that maximizing 
T2(f3k,Pk) is equivalent to maximizing cos(ipk — 2irpkQ /N), 
where 



The estimate of pk is thus obtained in closed-form as 



Pk 



Ncp k 
2ttQ' 



(33) 



(34) 



V. Simulation results 

A. System parameters 

The simulated system is inspired by [5]. The total number 
of subcarriers is N = 1024 while the number of ranging 
subchannels is R = 8. Each subchannel is composed by 
Q = 16 subbands uniformly spaced at a distance N/Q = 64. 
Any subband comprises V = 2 adjacent subcarriers while 
the ranging time-slot includes M = 4 OFDM blocks. The 
ranging codes are taken from a Fourier set of length 4 and 
are randomly selected by the RSSs at each simulation run 
(expect for the sequence [1, 1, 1, 1] T ). The discrete-time CIRs 
have L = 12 channel coefficients. The latter are modeled 
as circularly symmetric and independent Gaussian random 
variables with zero means (Rayleigh fading) and exponential 
power delay profiles, i.e., E{\hk(£)\ 2 } = <r^-exp(— £/12) with 
I = 0,1,..., 11 and d\ chosen such that £{||h fe || 2 } = 1. 
Channels of different users are assumed to be statistically 
independent. They are generated at each new simulation run 
and kept fixed over an entire time-slot. The normalized CFOs 
are uniformly distributed within the interval [—57, f2] and vary 
at each simulation run. We consider a cell radius of 10 km, 
corresponding to a maximum transmission delay # max = 204. 
A CP of length N G = 256 is chosen to avoid IBI. 

B. Performance evaluation 

We begin by investigating the performance of MCD in terms 
of probability of making an incorrect detection, say Pf, This 
parameter is illustrated in Fig. 1 as a function of SNR = 1/cr 2 
under different operating conditions. The number of active 
RSSs varies from 2 to 3 while the maximal frequency offset 
is either 17 = 0.05 or 0.075. Comparisons are made with the 
ranging scheme discussed by Fu, Li and Minn (FLM) in [5], 
where the fcth ranging code is declared active provided that 
the quantity 



a. 10 




Fig. 1. P f vs. SNR for K = 2 or 3 when Q is 0.05 or 0.075. 



1 

M 2 



Q-l V-l 

E E Ky(^)| 



(35) 



exceeds a suitable threshold r/ which is proportional to the 
estimated noise power a 2 . The results of Fig. 1 indicates that 
the proposed scheme performs remarkably better than FLM 
because of its intrinsic robustness against CFOs. As expected, 
the system performance deteriorates for large values of K and 
fl. The reason is that increasing K reduces the dimensionality 
of the noise subspace, which degrades the accuracy of the 
MUSIC estimator. Furthermore, large CFO values result into 
significant ICI which is not accounted for in the signal model 
®, where A(uik) has been replaced by Ijy. 

Fig. 2 illustrates the root mean-square-error (RMSE) of the 
frequency estimates obtained with MFE vs. SNR. Again, we 
see that the system performance deteriorates when K and 

are relatively large. Nevertheless, the accuracy of MFE is 
satisfactory under all investigated conditions. 

The performance of the timing estimators is measured in 
terms of probability of making a timing error, say P(e), as 
defined in [8]. More precisely, an error event is declared 
to occur whenever the estimate 0k gives rise to IBI during 
the data section of the frame. In such a case, the quantity 
6k — 8k + (—Nq.d + L)/2 is larger than zero or smaller 
than —Ng,o + L — 1, where Nq,d is the CP length during 
the data transmission phase. Fig. 3 illustrates P(e) vs. SNR 
as obtained with RC-TE and FLM when N G:D = 64. The 
operating conditions are the same of the previous figures. Since 
the performance of LS-TE is virtually identical to that of RC- 
TE, it is not reported in order not to overcrowd the figure. We 
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Fig. 2. Accuracy of the frequency estimates vs. SNR for K ■■ 
Q is 0.05 or 0.075. 
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Fig. 3. P(e) vs. SNR for K = 2 or 3 when fi is 0.05 or 0.075. 



see that for SNR values larger than 6 dB the proposed scheme 
provides much better results than FLM. 

VI. CONCLUSIONS 

We have derived a novel timing and frequency synchroniza- 
tion scheme for initial ranging in OFDMA-based networks. 
The proposed solution aims at detecting which codes are ac- 
tually being employed and provides timing and CFO estimates 
for all active RSSs. CFO estimation is accomplished by resort- 
ing to the MUSIC algorithm while a LS approach is employed 
for timing recovery. Compared to the timing synchronization 
algorithm discussed in [5], the proposed scheme is more robust 
to frequency misalignments and exhibits improved accuracy. 
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